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SEQUENCE LISTING 

<110> Diversa Corporation 
K. Gray 
J. Garrett 
N . Aboushadi 
R. Knowles 
E. O'Donoghue 
E . Waters 

<120> GLUCOSIDASES, NUCLEIC ACIDS ENCODING 
THEM AND METHODS FOR MAKING AND USING THEM 

<130> 564462004040 

<150> 60/456,972 
<151> 2003-03-20 

<160> 24 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1710 
<212> DNA 
<213> Bacteria 

<400> 1 

atgcagcgcc atcgcagaag gtgcagggct aagcttgtcg ggttcgtttt ggcaccccgt 60 

ttggcgggtg catggaagcc cggagggggg ccctcgatgt cgcagactcc atggtggcgc 120 

ggtgccgtca tctaccagat ctacccgcgc agtttcctcg acgccaacgg cgacggggtc 180 

ggtgacctgc ccggcatcat cgaccggctg gagtacgtgg ccgcgctggg cgtggacgcc 240 

atctgggtct cgccgttctt cacctcgccg atggccgatt tcggctacga catcgccgac 300 

catcgcgacg tggacccgct gtttggcacg ctggccgatt tcgaccggct gctggccaag 360 

gcgcatgcgc tgggcctgaa ggtgatgatc gaccaggtgt tcagccacac ctcgatcgac 420 

cacgcctggt tccgtgagag ccggcaggac cgcaccaatc cgaaggcgga ctggtacgtg 480 

tgggccgacc cgcgcgagga cggcacgccg cccaacaact ggatgtcgat cttcggcggg 540 

gtggcctggc aatgggagcc gcgccgggag cagtacttcc tgcacaactt cctggccgac 600 

cagccggacc tggatttcca caacccggcg gtgcagcagg ccacgctgga ctacgtgcgc 660 

ttctggctgg accggggcgt ggacgggttc cgcctggact cgatcaactt ctgcttccac 720 

gacgcgcagt tgcgcgacaa cccggccaag ccgctggaaa agcgcgtcgg ccgtggcttc 780 

agcgcggaca atccgtacgc ctaccagtac cactactaca acaacaccca gccggagaac 84 0 

atcggcttca tcgagcgcct gcgtgggttg ctggacgaat acccgggcac cgtgagcctg 900 

ggcgagatct cggccgagga ctcgctggcc accaccgccg agtacaccgc gccgggccgc 960 

ctgcacatgg gctacagctt cgagctgctg gtgaaggatt tcagcgccgg ctacatccgc 1020 

gacaccgtgt cgcggctgga agcgacgatg accgaaggct ggccgtgctg ggcgatctcc 1080 

aaccacgacg tggagcgtgc ggtcactcgc tggggcggcc atccggcccg gccgcggctg 114 0 

gcgcggatgc tggtggcgct gctgtgctcg ctgcgtggct cgatctgcct gtaccagggc 1200 

gaggagctgg gcctgggcga ggcggacgtg ccgttcgagg cgctgcagga cccgtatggc 12 60 

atcaccttct ggccgaactt caagggccgc gacggctgcc gcacgccgat gccgtggatc 1320 

gatgcgccgt tggcagggtt caccagcggt gagccgtggc tgccgattcc ggccgagcac 1380 

cgcgccgcgg cggtggcggt gcaggagcac gacccgcact cggtgttgaa cgcgttccgc 1440 

cagttcctgg catggcgcag gacgatgccg acgctgctgg tgggcgacat cgtcttcctg 1500 

cagacggccg agccggtgct gatgttcgag cgccggcatg cgggggagac gctgctgctg 1560 

gccttcaacc tggcggccga caccgcgcgc gtggcgctgc ccgccggcag ctggcagccg 1620 

atgcacgtgc cgggcccgga cgtgggccag gccgacggcg ggacgttggt actgccggcg 1680 

cagtcgatgt actgcgcgcg cctgggctga 1710 
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<210> 2 
<211> 569 
<212> PRT 
<213> Bacteria 

<220> 

<221> SIGNAL 
<222> (1) . . . (24) 

<221> DOMAIN 
<222> (46) . . . (434) 

<223> Alpha amylase, catalytic domain 
<400> 2 

Met Gin Arg His Arg Arg Arg Cys Arg Ala Lys Leu Val Gly Phe Val 

5 10 15 

Leu Ala Pro Arg Leu Ala Gly Ala Trp Lys Pro Gly Gly Gly Pro Ser 

20 25 30 

Met Ser Gin Thr Pro Trp Trp Arg Gly Ala Val He Tyr Gin He Tyr 

35 40 45 

Pro Arg Ser Phe Leu Asp Ala Asn Gly Asp Gly Val Gly Asp Leu Pro 

^ u 55 50 

Gly He He Asp Arg Leu Glu Tyr Val Ala Ala Leu Gly Val Asp Ala 

He Trp Val Ser Pro Phe Phe Thr Ser Pro Met Ala Asp Phe Gly 5yr 

85 90 95 

Asp lie Ala Asp His Arg Asp Val Asp Pro Leu Phe Gly Thr Leu Ala 

100 105 H0 

Asp Phe Asp Arg Leu Leu Ala Lys Ala His Ala Leu Gly Leu Lys Val 

115 120 125 

Met lie Asp Gin Val Phe Ser His Thr Ser He Asp His Ala Trp Phe 

Arg Glu Ser Arg Gin Asp Arg Thr Asn Pro Lys Ala Asp Trp Tyr Val 

150 155 160 

Trp Ala Asp Pro Arg Glu Asp Gly Thr Pro Pro Asn Asn Trp Met Ser 

165 170 175 

lie Phe Gly Gly Val Ala Trp Gin Trp Glu Pro Arg Arg Glu Gin Tyr 

180 185 190 

Phe Leu His Asn Phe Leu Ala Asp Gin Pro Asp Leu Asp Phe His Asn 

195 200 205 

Pro Ala Val Gin Gin Ala Thr Leu Asp Tyr Val Arg Phe Trp Leu Asp 

215 220 
Arg Gly Val Asp Gly Phe Arg Leu Asp Ser He Asn Phe Cys Phe His 

230 235 240 

Asp Ala Gin Leu Arg Asp Asn Pro Ala Lys Pro Leu Glu Lys Arg Val 

245 250 255 

Gly Arg Gly Phe Ser Ala Asp Asn Pro Tyr Ala Tyr Gin Tyr His Tvr 

260 265 270 

Tyr Asn Asn Thr Gin Pro Glu Asn He Gly Phe He Glu Arg Leu Arg 
275 280 285 

Gly ™« LeU ASP Glu Tyr Pro G1 y Thr Val Ser Leu Gly Glu lie Ser 

290 295 300 

Ala Glu Asp Ser Leu Ala Thr Thr Ala Glu Tyr Thr Ala Pro Gly Arg 

310 315 -joo 

Leu His Met Gly Tyr Ser Phe Glu Leu Leu Val Lys Asp Phe Ser Ala 

325 330 3 35 

Gly Tyr He Arg Asp Thr Val Ser Arg Leu Glu Ala Thr Met Thr Glu 

340 345 350 

Gly Trp Pro Cys Trp Ala He Ser Asn His Asp Val Glu Arg Ala Val 
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Thr Arg 
370 
Val Ala 
385 

Glu Glu 

Asp Pro 

Cys Arg 

Ser Gly 
450 
Val Ala 
465 

Gin Phe 

lie Val 

His Ala 

Ala Arg 
530 
Gly Pro 
545 

Gin Ser 



355 

Trp Gly 

Leu Leu 

Leu Gly 

Tyr Gly 
420 
Thr Pro 
435 

Glu Pro 

Val Gin 

Leu Ala 

Phe Leu 
500 
Gly Glu 
515 

Val Ala 
Asp Val 
Met Tyr 



Gly His 

Cys Ser 
390 
Leu Gly 
405 

lie Thr 

Met Pro 

Trp Leu 

Glu His 
470 
Trp Arg 
485 

Gin Thr 

Thr Leu 

Leu Pro 

Gly Gin 
550 
Cys Ala 
565 



360 
Pro Ala 
375 

Leu Arg 

Glu Ala 

Phe Trp 

Trp lie 
440 
Pro He 
455 

Asp Pro 

Arg Thr 

Ala Glu 

Leu Leu 
520 
Ala Gly 
535 

Ala Asp 



Arg Pro 

Gly Ser 

Asp Val 
410 
Pro Asn 
425 

Asp Ala 

Pro Ala 

His Ser 

Met Pro 
490 
Pro Val 
505 

Ala Phe 



Ser Trp 
Gly Gly 
Arg Leu Gly 



Arg 

He 
395 
Pro 

Phe 

Pro 

Glu 

Val 
475 
Thr 

Leu 

Asn 

Gin 

Thr 
555 



365 
Leu Ala 
380 

Cys Leu 

Phe Glu 

Lys Gly 

Leu Ala 
445 
His Arg 
4 60 

Leu Asn 

Leu Leu 

Met Phe 

Leu Ala 
525 
Pro Met 
540 

Leu Val 



Arg Met Leu 

Tyr Gin Gly 
400 

Ala Leu Gin 

415 
Arg Asp Gly 
4 30 

Gly Phe Thr 

Ala Ala Ala 

Ala Phe Arg 
480 

Val Gly Asp 

4 95 
Glu Arg Arg 
510 

Ala Asp Thr 

His Val Pro 

Leu Pro Ala 
560 



<210> 3 
<211> 1293 
<212> DNA 
<213> Unknown 

<220> 



<223> Obtained from an environmental sample 



<400> 3 

atgagtctgt 

gccgacgtgg 

aacggcgagg 

agccacctcg 

gatcaggcca 

gtccgcaacc 

cggatgcgga 

cggcagccga 

cgaccgggca 

aaccagaccc 

gacgggactc 

acgcgtgtcc 

gtctcccgcg 

gcgcacctct 

gacgtcagct 

acccagagtt 

atcgatttcg 

tgggacggag 

ttcgacatcc 

cacgaaacca 

gaccgccagc 

caggcgctcg 



ggcgtgcgct 

tcgcgaccgc 

gccgcatcgg 

gattcctcct 

cccggacctt 

actacaacga 

tcgtttttcg 

acctggcgca 

cggcctggtg 

cgatcgacgg 

acctcagcat 

agggcacgaa 

ataccccatt 

acgaatcgaa 

gggtccaccc 

gggcctcggg 

cggcgacgaa 

actggttcgc 

gagccgtcgc 

gcggcaacat 

tcggcatcga 

ggcccgacgg 



cgtcgcgttt 

ctcttcgcca 

ctatgcggtc 

ggccgacgcg 

cgacgaaacg 

gctcgcgatc 

cctgttcgat 

cgccaacatc 

ggcaccggcg 

tgtcgcgatt 

ccacgaagcc 

cttcaaggcc 

cgagaccccg 

caggctgatc 

gcgcaaatat 

gccgaagcac 

cggctttacc 

gaccggcgac 

ggcctacagc 

cgcccactac 

cacggtgaag 

caggatccaa 



gttctgctgg 

ggcgatgtcc 

acccggctcg 

ccgcagctgc 

tgggagcagc 

accttcgagg 

gaagggatcg 

gccgaggaac 

ttcgaatcca 

gcgatgactc 

gcgctggtcg 

atcctgacgc 

tggcgggtca 

ctcaacctca 

gtcggcatct 

ggcgcgacca 

gggctgctcg 

gatttcagct 

ctcaggaaag 

gagcaacagc 

acgggctacg 

aga 



ccgtcgccat 

tcaaggtgga 

gcaagccggt 

tgcgcaactt 

cgtgggggga 

agaagaccaa 

gctttcgcta 

tgacccagtt 

accgcgagga 

cattcacgat 

actattccgg 

ccggttcgat 

tcctgatcag 

acgaacccaa 

ggtggggcat 

ccgcttatgc 

tcgaaggctg 

tcaccgaacc 

gcgtccacct 

tcgatgcggc 

tctcggatgc 



gcccgcattt 

gatcaccacc 

aatcggcgag 

ccaggtcgtc 

gtggcgcacg 

gctccatcgg 

cgagcttccc 

caacgtcgcg 

atatctctac 

gcggttcgag 

gatgaacgtc 

gggccccaaa 

ccccgacgct 

caagctcggc 

gcacctcgat 

gaagcgaatg 

gaacaaggga 

ctatcccgat 

catcgggcat 

gctcgatctc 

cggcggcatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1293 
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<210> 4 
<211> 431 
<212> prt 
<213> Unknown 



<220> 

<223> Obtained from an environmental sample 

<221> SIGNAL 
<222> (1) . . . (21) 



<400> 4 

Met Ser Leu Trp Arg Ala Leu Val Ala Phe Val Leu Leu Ala Val Ala 

5 3-0 15 

Met Pro Ala Phe Ala Asp Val Val Ala Thr Ala Ser Ser Pro Gly Asp 

20 25 30 

Val Leu Lys Val Glu He Thr Thr Asn Gly Glu Gly Arg He Gly Tyr 

35 40 45 

Ala Val Thr Arg Leu Gly Lys Pro Val He Gly Glu Ser His Leu Gly 

50 55 60 

Phe Leu Leu Ala Asp Ala Pro Gin Leu Leu Arg Asn Phe Gin Val Val 

70 75 Q0 

Asp Gin Ala Thr Arg Thr Phe Asp Glu Thr Trp Glu Gin Pro Trp Gly 

85 90 95 

Glu Trp Arg Thr Val Arg Asn His Tyr Asn Glu Leu Ala He Thr Phe 

100 105 HO 

Glu Glu Lys Thr Lys Leu His Arg Arg Met Arg He Val Phe Arg Leu 

115 120 125 

Phe Asp Glu Gly He Gly Phe Arg Tyr Glu Leu Pro Arg Gin Pro Asn 

130 135 140 

Leu Ala His Ala Asn He Ala Glu Glu Leu Thr Gin Phe Asn Val Ala 
5 150 155 160 

Arg Pro Gly Thr Ala Trp Trp Ala Pro Ala Phe Glu Ser Asn Arg Glu 
165 170 
Asn Gin Thr Pro 

Met Arg Phe Glu 
195 200 205 



Glu Tyr Leu Tyr Asn Gin Thr Pro He Asp Gly Val Ala He Ala Met 

180 185 190 

Thr Pro Phe Thr Met Arg Phe Glu Asp Gly Thr His Leu Ser He His 
195 o n n ~ _ 



Glu Ala Ala Leu Val Asp Tyr Ser Gly Met Asn Val Thr Arg Val Gin 

210 215 220 

Gly Thr Asn Phe Lys Ala He Leu Thr Pro Gly Ser Met Gly Pro Lys 

230 235 " 240 

Pro Trp Arg Val He Leu 
250 255 
- Ser Asn Arg Leu He T ~- 
260 265 270 



Val Ser Arg Asp Thr Pro Phe Glu Thr Pro Trp Arg Val lie Leu He 

245 250 255 

Ser Pro Asp Ala Ala His Leu Tyr Glu Ser Asn Arg Leu He Leu Asn 

260 265 270 

Leu Asn Glu Pro Asn Lys Leu Gly Asp Val Ser Trp Val His Pro Arg 

275 280 285 

Lys Tyr Val Gly He Trp Trp Gly Met His Leu Asp Thr Gin Ser Trp 

295 300 



Ala Ser Gly Pro Lys His Gly Ala Thr Thr Ala Tyr Ala Lys Arg Met 

310 315 
Thr Asn Gly Phe Thr Gly 
330 

Asp Gly Asp Trp Phe Ala 

345 , v , 
Ser Phe Thr Glu Pro Tyr Pro Asp Phe Asp Il e Arg Ala Val Ala Ala 
355 360 365 



He Asp Phe Ala Ala Thr Asn Gly Phe Thr Gly Leu Leu Val Glu Gly 

325 330 335 

Trp Asn Lys Gly Trp Asp Gly Asp Trp Phe Ala Thr Gly Asp Asp Phe 
Jq0 345 350 
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Tyr Ser Leu 


Arg 


Lys 


Gly Val 


His 


370 






375 




Gly Asn lie 


Ala 


His 


Tyr Glu 


Gin 


385 






390 




Asp Arg Gin 


Leu 


Gly 


He Asp Thr 






405 






Ala Gly Gly 


He 


Gin 


Ala Leu 


Gly 




420 







<210> 5 
<211> 1773 
<212> DNA 
<213> Bacteria 



Leu He Gly His His Glu Thr Ser 
380 

Gin Leu Asp Ala Ala Leu Asp Leu 
395 400 
Val Lys Thr Gly Tyr Val Ser Asp 

410 " 415 

Pro Asp Gly Arg He Gin Arg 
425 430 



<400> 5 



atgaccctca acaacaccca tgccgactgg tggaaacagg cggtggtcta ccaggtctac 60 
ccgcgcagct tcaaggatgt gaacggtgac ggtctgggcg alatcgccgg cgtgaccga"c 

^ccaaaS 3tCtCaa ^ a ^tgggcgtc gacgcgatct ggttgtcgcc g?tctacccg 180 

tccgagctgg cggacggcgg ctacgacgtc atcgactacc gcgacgtgga cccgcgcctg 240 

ggctccatgg acgacttcga cgccatggcc gcagccgcgc acgaggccgg catgaaggtg 300 

atcgtggaca tcgtgccgaa ccacacctcc gaccggcacg tctggtttga ggaggcgctg 360 

gcggcagaac cgggctcccc cgcgcgcgac cggtacatct tccgcgacgg rctgggcgaa 420 

cacggcgagc tgcccccgaa cgactggcag tcgatcttcg gcggcggcgc atgggaacgg 480 

gtgccggacg gccagtggta cctgcacatg ttcgccaagg aacagcccga cc?caac?gg 540 

aagaacccgg aggtccacga ggagttcaag aagaccctgc gtttctggtc ggaccatggc 600 

gcggacgggt tccgcatcga cgtggcgcac ggcctggcca aggacc?cga cagcgcgclg 660 

ctgaccgaac tggcgttgaa gggcaacccg ctggagggac tgtgccatga cggcacgaa? 720 

ccgctgtggg accgccccga ggtgcacgac atctaccgcg aatggcgcga gg?gttSac 780 

= cgccgcgctt cgccgtcggc gaggcctggg tgg?gcccga aca?cagcac 840 

ctgtacgcct ccgaggagga gctggggcag gtgttcaact tcgaattcgc caaggccgac 900 

aS^ CCg agga " tccg "ccgccatc cgcgagggtc tggaatcagc cgaggaa?cc 960 

ccacctgggt gatgagcaac catgacgtgg tccgccacgc cag?cggtac 1020 

Tattatttnt ag9 ^ gaagac ^agtggtac caccagctgg ccgacgcctg gctgcigcgc 1080 

agcggttcga acttcatcga ggaccgtgag ctcggcaccc gccgcgctcg cgccgcgatc 1140 

atgatggaac tcggtctgcc cggttccgtg tacgtgtacc agggcgagga gctgggcctg 1200 

SSKS* CggaCatCCC g^ggaccgt ctggaggacc cgaccccg^t ccgcl^cgal 1260 

cacggccgca ccgccaaggg tcgtgacggc tgccgtgtgc cgttgccgtg gtccgccgcc 1320 

9a ^ gCg r ga ^ CCC ^ C< 3 CC gtgggatgcg aagttcggca ccggtgcctc gttcggcttc 1380 

tccccgtcca tgcgacccga cggcgagccc gccgccgatc cgcacctgcc gcaaccattg 1440 

J?™ a ^ gg att l c * cc, * c cgatgtggag gactccgatc cggattcgat gctcaacctg 1500 

taccgcaggg ccatcgcgtt ccgccagcgg atgctcaccc cgaccggcga tacgtcgatc 1560 

cggtggatcg gggattccga gttcggcgac aaggccggcc aggtggiggc gta?tcgcgc 1620 

T^lTt^ aCgagggcg ^ cgaccgcttc gccagcatca cSac^tcgg cccgga^g iIsS 

gtcgagcttc cggccggtga gattgtcttt gcttcgggcc cgctggatgg cgtccggctg 1740 

cccgccgaca ccaccgtgtg gatgatggtc tga y^cggccg 17 40 



<210> 6 
<211> 590 
<212> PRT 
<213> Bacteria 



<220> 

<221> DOMAIN 
<222> (18) . . . (432) 

<223> Alpha amylase, catalytic domain 
<400> 6 

Met Thr Leu Asn Asn Thr His Ala Asp Trp Trp Lys Gin Ala Val Val 
5 !0 15 
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Tyr Gin Val Tyr Pro Arg Ser Phe Lys Asp Val Asn Gly Asp Gly Leu 

20 25 30 

Gly Asp lie Ala Gly Val Thr Asp Arg He Pro Tyr Leu Lys Glu Leu 

35 40 45 

Gly Val Asp Ala lie Trp Leu Ser Pro Phe Tyr Pro Ser Glu Leu Ala 

50 55 60 

Asp Gly Gly Tyr Asp Val lie Asp Tyr Arg Asp Val Asp Pro Arg Leu 

70 75 qq 

Gly Ser Met Asp Asp Phe Asp Ala Met Ala Ala Ala Ala His Glu Ala 

85 90 g5 

Gly Met Lys Val He Val Asp He Val Pro Asn His Thr Ser Asp Arg 

ioo 105 110 

His Val Trp Phe Glu Glu Ala Leu Ala Ala Glu Pro Gly Ser Pro Ala 

115 120 125 

Arg Asp Arg Tyr lie Phe Arg Asp Gly Leu Gly Glu His Gly Glu Leu 
130 135 14Q 

Pro Pro Asn Asp Trp Gin Ser lie Phe Gly Gly Gly Ala Trp Glu Arg 

I 50 155 ISO 

Val Pro Asp Gly Gin Trp Tyr Leu His Met Phe Ala Lys Glu Gin Pro 

165 170 * 175 

Asp Leu Asn Trp Lys Asn Pro Glu Val His Glu Glu Phe Lys Lys Thr 

180 185 19Q 

Leu Arg Phe Trp Ser Asp His Gly Ala Asp Gly Phe Arg He Asp Val 

195 200 205 

Ala His Gly Leu Ala Lys Asp Leu Asp Ser Ala Pro Leu Thr Glu Leu 

210 215 220 

Ala Leu Lys Gly Asn Pro Leu Glu Gly Leu Cys His Asp Gly Thr Asn 

„ t 230 235 240 

Pro Leu Trp Asp Arg Pro Glu Val His Asp He Tyr Arg Glu Trp Arg 

245 250 255 

Glu Val Phe Asn Glu Tyr Asp Pro Pro Arg Phe Ala Val Gly Glu Ala 

260 265 270 

Trp Val Val Pro Glu His Gin His Leu Tyr Ala Ser Glu Glu Glu Leu 
275 280 285 

GlY oo« Val PhS Asn Phe Glu Phe Ala Aia As P Trp Phe Ala Glu 
290 295 300 

Asp Phe Arg Thr Ala He Arg Glu Gly Leu Glu Ser Ala Glu Glu Ser 

310 315 220 

Gly ser Thr Thr Thr Trp Val Met Ser Asn His Asp Val Val Arg His 

325 330 335 

Ala Ser Arg Tyr Gly Leu Pro Gin Val Lys Thr Gin Trp Tyr His Gin 

340 345 350 

Leu Ala Asp Ala Trp Leu Leu Arg Ser Gly Ser Asn Phe He Glu Asp 

355 360 365 

Arg Glu Leu Gly Thr Arg Arg Ala Arg Ala Ala He Met Met Glu Leu 

370 375 380 

Gly Leu Pro Gly Ser Val Tyr Val Tyr Gin Gly Glu Glu Leu Gly Leu 

390 395 400 

Phe Glu Val Ala Asp He Pro Trp Asp Arg Leu Glu Asp Pro Thr Pro 

405 4 10 415 

Phe Arg Thr Asp His Gly Arg Thr Ala Lys Gly Arg Asp Gly Cys Ara 

420 425 430 

Val Pro Leu Pro Trp Ser Ala Ala Asp Ala Pro Thr Pro Ala Pro Trp 

135 440 445 

Asp Ala Lys Phe Gly Thr Gly Ala Ser Phe Gly Phe Ser Pro Ser Met 

450 455 460 

Arg Pro Asp Gly Glu Pro Ala Ala Asp Pro His Leu Pro Gin Pro Leu 

• « r 470 475 480 

Trp Tyr Lys Asp Phe Ala Ala Asp Val Glu Asp Ser Asp Pro Asp Ser 
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485 



Met Leu Asn 


Leu 


Tyr Arg Arg Ala 




500 




Thr Pro Thr 


Gly 


Asp Thr Ser lie 


515 




520 


Gly Asp Lys 


Ala 


Gly Gin Val Val 


530 




535 


Glu Gly Gly 


Asp 


Arg Phe Ala Ser 


545 




550 


Val Glu Leu 


Pro 


Ala Gly Glu He 






565 


Gly Val Arg 


Leu 


Pro Ala Asp Thr 




580 



<210> 7 
<211> 2187 
<212> DNA 
<213> Bacteria 



490 495 
He Ala Phe Arg Gin Arg Met Leu 
505 510 
Arg Trp He Gly Asp Ser Glu Phe 
525 

Ala Tyr Ser Arg Val Ser Thr Asp 
540 

He Thr Asn Phe Gly Pro Asp Ser 
555 seo 
Val Phe Ala Ser Gly Pro Leu Asp 

570 575 
Thr Val Trp Met Met Val 
585 590 



<400> 7 

atggtcggtg tcgccgcact cgacgacacc gtgctacgcg tagcctactg caggtcccct 60 

ggggagtggc cgacgagcac acctgccatt gtggaacaga tgtcgcagcg gcactcctgg 120 

cggctcgttc aggaagagcg acgcgtgcaa ctggaatgtg ttgccggatg gcagattcaa 180 

atcaaccggg acgacggaac atggagtata cgccatctcg gctttgggac agccgtcgag 240 

gccatcacgt ggtacaagcg caaaaaaggc ggcgcattga cgtttgcttc gctcgacaat 300 

gcccggttct atggacttgg cgaaaagccg gggccccttg acaagcggca cgaggcgtac 360 

accatgtgga attcggacgt ttacgcgcct cacgtgccag agatggaggc ccta?acctg 420 

tcgatcccgt tctttttgcg attgcaagac caaacggccg ttggStSt tgtggataa? 480 

ccagggcgat cccgctttga cttcaggagc agataccctg acgtggaaat atcgacagaa 540 

cgaggcgggt tggacgttta tttcatcttc ggggcttcgc tcaaagatgt cattcgacgt 600 

l^^ttlt tc t C<3g9gcg aatgccgatg ccacccaagt gggcgctcgg ctatcatcag 660 

tctcggtact cgtacgagac ccagagcgag gtcctgtcgg tggcgcaaac gtttgtcgag 720 

cgcgacattc cggtggatgc cttgtatctg gatattcact acatggatgg atatcgggtg 780 

ttcacctttg atgagaggcg atttcccgat ccggcacgca tgtgcgacga gttgcggaag 840 

cttggggtgc gcgttgtccc catcgtggat ccgggggtga aacaggatcc rgaatltccc 900 

gtttacatgg acgggctagc ccacaaccac ttctgccaaa ccgccgaggg tcaagtgtat 960 

cttggtgagg tatggcccgg attgagtgcc tttcctgatt tcgcgtcgga ggaagtgcgg 1020 

gcgtggtggg gcaagtggca tcgggtgtac acccagatgg gaattgaggg catatgqaat 1080 

gatatgaacg agccggctgt gttcaatgag acgaagacca ?ggacgtgll cg£gg??cac Ji!S 

cgaggcgacg gacgattgta cacgcacgga gaggtccaca acctctatgg attctggatg 1200 

gctgaagcca cgtatcgcgg actgaaagcg cagttggctg gcaagcggcc ttttg?gctc 1260 

acgcgcgctg gctacagcgg aattcaacgg tatgcggcgg tgtggacggg cgacaaccga 1320 

agcttttggg aacatatggc catggccatc ccgatggtgc tcaacatggg gltgtccgg? 1380 

attccgctcg ggggcccaga tgtgggcggt ttcgctcatc acgcttcagg tgaattgc?t 1440 

gcacgatgga cgcaaatggg ggcgttcttt ccgttctttc gaaatcacag tgccatggga 1500 

acgcatcgac aggaaccttg ggcattcggc cctacgttcg aagccgtcat tcgacgtgcc 1560 

atccggctcc gttatcgttt cctgccttat ctgtacacgc tcgcacggga agcccataaa lltl 

acaggtttac ccatgatgcg gcccttggtg ttagagta?c ccgacg^cc aEcgcac JIsS 

cacgtcgatg atcagttcct cgtcggttcc gatcttctcg tggcacccat cc?caagccg ll40 

ggcatggccc atagaatggt ctatttgccc gatggagaat ggattgatta tgagacgcg? 180Q 

gagcgatacc aagggcgtca gtacatcttg acgtatgcgc ctctcgatcg tatccc?c?g 1860 

tacgtgaggg cagggagcgc tatccccgtg aacctcttgg agcggtcggg cgagacccaa 1920 

ctaggatggg agatcttcgt ggacgccaac ggtcgagc?? cggg?cgl?g c?a?gaggac ileS 

gacggcgaga cgtttagcta tgaagacggc gcctactgtg a?cgcg?gt? acaggccSc 2?!2 

gccacctcgg aaggaacact gatcgaatgt catcttgttc aaggatcagg aga?gg?gg a 21o2 

t^cc^cca 9 Sg^gr" °— ^ "^-^ £2 

' ^ 2187 



<210> 8 
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<211> 728 
<212> PRT 
<213> Bacteria 



<220> 

<221> DOMAIN 
<222> (58) . . . (717) 

<223> Glycosyl hydrolases family 31 
<400> 8 

Met Val Gly Val Ala Ala Leu Asp Asp Thr Val Leu Arg Val Ala Tyr 

5 10 15 

Cys Arg Ser Pro Gly Glu Trp Pro Thr Ser Thr Pro Ala He Val Glu 

20 25 30 

Gin Met Ser Gin Arg His Ser Trp Arg Leu Val Gin Glu Glu Arg Arg 

Val Gin Leu Glu Cys Val Ala Gly Trp Gin He Gin He Asn Arg Asp 

50 55 60 

Asp Gly Thr Trp Ser He Arg His Leu Gly Phe Gly Thr Ala Val Glu 

70 75 80 

Ala lie Thr Trp Tyr Lys Arg Lys Lys Gly Gly Ala Leu Thr Phe Ala 

85 90 95 

Ser Leu Asp Asn Ala Arg Phe Tyr Gly Leu Gly Glu Lys Pro Gly Pro 

100 10 5 110 

Leu Asp Lys Arg His Glu Ala Tyr Thr Met Trp Asn Ser Asp Val Tvr 

115 120 125 

Ala Pro His Val Pro Glu Met Glu Ala Leu Tyr Leu Ser He Pro Phe 

. 130 135 140 

Phe Leu Arg Leu Gin Asp Gin Thr Ala Val Gly He Phe Val Asp Asn 

150 155 160 

Pro Gly Arg Ser Arg Phe Asp Phe Arg Ser Arg Tyr Pro Asp Val Glu 

165 170 175 

He Ser Thr Glu Arg Gly Gly Leu Asp Val Tyr Phe He Phe Gly Ala 

180 185 190 

Ser Leu Lys Asp Val He Arg Arg Tyr Thr Lys Leu Thr Gly Arg Met 

195 200 205 

Pro Met Pro Pro Lys Trp Ala Leu Gly Tyr His Gin Ser Arg Tyr Ser 

210 215 220 

Tyr Glu Thr Gin Ser Glu Val Leu Ser Val Ala Gin Thr Phe Val Glu 

I . 230 235 240 

Arg Asp He Pro Val Asp Ala Leu Tyr Leu Asp He His Tyr Met Asp 

245 250 255 

Gly Tyr Arg Val Phe Thr Phe Asp Glu Arg Arg Phe Pro Asp Pro Ala 

260 265 270 

Arg Met Cys Asp Glu Leu Arg Lys Leu Gly Val Arg Val Val Pro He 
275 280 285 

290 01 y 295 Pr ° MSt ASP 

Gly Leu Ala His Asn His Phe Cys Gin Thr Ala Glu Gly Gin Val Tvr 
305 310 315 " 3 ^ 0 

Leu Gly Glu Val Trp Pro Gly Leu Ser Ala Phe Pro Asp Phe Ala Ser 

325 330 * 335 

Glu Glu Val Arg Ala Trp Trp Gly Lys Trp His Arg Val Tyr Thr Gin 

340 345 350 

Met Gly He Glu Gly He Trp Asn Asp Met Asn Glu Pro Ala Val Phe 

355 360 365 

Asn Glu Thr Lys Thr Met Asp Val Asn Val Val His Arg Gly Asp Gly 

370 375 380 

Arg Leu Tyr Thr His Gly Glu Val His Asn Leu Tyr Gly Phe Trp Met 



8 



WO 2004/085615 



PCT/US2004/008541 



385 390 395 400 

Ala Glu Ala Thr Tyr Arg Gly Leu Lys Ala Gin Leu Ala Gly Lys Arg 

405 410 415 

Pro Phe Val Leu Thr Arg Ala Gly Tyr Ser Gly He Gin Arg Tyr Ala 

420 425 430 

Ala Val Trp Thr Gly Asp Asn Arg Ser Phe Trp Glu His Met Ala Met 

435 440 445 

Ala He Pro Met Val Leu Asn Met Gly Met Ser Gly He Pro Leu Glv 

450 455 460 

Gly Pro Asp Val Gly Gly Phe Ala His His Ala Ser Gly Glu Leu Leu 
465 470 475 480 

Ala Arg Trp Thr Gin Met Gly Ala Phe Phe Pro Phe Phe Arg Asn His 

485 490 495 

Ser Ala Met Gly Thr His Arg Gin Glu Pro Trp Ala Phe Gly Pro Thr 

500 505 510 

Phe Glu Ala Val He Arg Arg Ala He Arg Leu Arg Tyr Arg Phe Leu 

515 520 525 

Pro Tyr Leu Tyr Thr Leu Ala Arg Glu Ala His Glu Thr Gly Leu Pro 
530 535 540 

Met Met Arg Pro Leu Val Leu Glu Tyr Pro Asp Asp Pro Asn Thr His 
5 550 555 560 

His Val Asp Asp Gin Phe Leu Val Gly Ser Asp Leu Leu Val Ala Pro 

565 570 575 

He Leu Lys Pro Gly Met Ala His Arg Met Val Tyr Leu Pro Asp Glv 

580 585 ' 590 

Glu Trp He Asp Tyr Glu Thr Arg Glu Arg Tyr Gin Gly Arg Gin Tyr 

595 600 605 

He Leu Thr Tyr Ala Pro Leu Asp Arg He Pro Leu Tyr Val Arg Ala 

610 615 620 

Gly Ser Ala He Pro Val Asn Leu Leu Glu Arg Ser Gly Glu Thr Gin 

5 5 , 630 "5 640 

Leu Gly Trp Glu He Phe Val Asp Ala Asn Gly Arg Ala Ser Gly Arg 

645 650 655 

Cys Tyr Glu Asp Asp Gly Glu Thr Phe Ser Tyr Glu Asp Gly Ala Tvr 

660 665 670 

Cys Asp Arg Val Leu Gin Ala Leu Ala Thr Ser Glu Gly Thr Leu He 

675 680 685 

Glu Cys His Leu Val Gin Gly Ser Gly Asp Gly Gly Ser Leu Glu Ser 

690 695 700 

Val Val Arg Val Phe Thr Pro Asp Asp Val Arg Glu Ala Arg Ala Gin 

710 715 * 720 

Gly He Ser Phe Ser He His Val 

725 

<210> 9 
<211> 1611 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 9 

atgacagagt ggtggcgtgg tgcagtgacc tatcaagtct atccaaggtc gtttcaggac 60 

agcaacggcg acggcatcgg cgacctgccc ggcatcaccg cccggcttga gtatctggcc 120 

gatcttggcg tggacgcggt ctggctgtca ccgttcttca aaagcccgat gaaggacatg 180 

ggctatgacg tcagcgacta ttgcgatgtc gatccggtct tcggcaccct cgccgatttt 24 0 

gacgccctgc tggcccgcgc gcatgagctg gggctcaagg tgatcatcga ccaggtcctt 300 

agccacagtt ccgacctgca ccctgccttt gtgaccagtc gcagcgaccg cgtgaacccg 360 
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<210> 10 
<211> 536 
<212> PRT 
<213> Unknown 



420 
480 
540 
600 



aaggcggact ggtatgtctg ggccgatccc aagcccgacg gcagcccgcc caacaactgg 
ctgtcggtgt tcggtggctc ggcatgggcc tgggacgcgc gcagaaaaca gtattacctg 
cacaatttcc tgaccagcca gccggacctg aactaccaca acccgaaggt gcaggactgg 
gcgctggaca acatgcgttt ctggctggac cggggcgtgg acgggttccg ctttgacacc 

gtcaactact tcttccacga tcccttgttg cgcagcaacc ctgccgatca ccgcaacaag 660 

cctgaggctg acggcaatcc ctacggcatg cagtaccacc tgcatgacaa gaaccagccc 720 

gagaacctga tctggatgga gcggatacgg gtgcttctgg accaatacgg tgccgcaagc 780 

Utility tgggcgaaag tcacc ^oc atccggatga tgggcgacta caccgctccg 840 

gggcggctgc atcaatgcta cagctttgaa ttcatggggt atgaatacac cgcaaacctg 900 

ttccgggacc ggatagaaag ctttttcaag ggtgccccta aaggctggcc gatgtgggcg 960 

llaattnttn acgatgtcgt ccgccatgtc agtcgctggg caaaacatgg cctcacccc? 1020 

tlltJ.nl 9 Ccaagcagac ^gg^cgttg cttctgtcgc ttgagggctc gatctgcctg 1080 

tgggagggcg aggagctggg ccagaccgat accgaactgg ccttggatga gttgaccgat 114 0 

ccgcagggca tcgtcttttg gcccgaaccg atcggccgcg acaatactcg gacgccaa?g 120S 

gtttgggacg catcgccgca tggcgggttt tcgaccgtca caccctggct gccggtgaaa 1260 

ccggaacagg ccgcgcgtca tgtggccggg caaaccggtg atgccgcctc ggtgctggaa 1320 

cgatgctggc cttccggcgc gctgaaccgg cccttaggac Igggcggacg liso 

cggtttctgg atctggccga accggttctg ggctttgtgc gcggcgaagg ggagggtgcg 1440 

atcctgtgcc tgttcaatct gtcgcctgtt gcgcgggggg ttgcggtcga aggcg?gggc ISM 

aacoocScca llttll^ gc ? ggctatc ctttcgggcg gacggctagg ccftggcccg ^560 

aacggcgccg ccttcctgcg ggtgaccgga acagtccgcg ttctggacta a i 6 ll 



<220> 

<223> Obtained from an environmental sample 

<221> DOMAIN 
<222> (12)... (449) 

<223> Alpha amylase, catalytic domain 
<400> 10 

Met Thr Glu Trp Trp Arg Gly Ala Val Thr Tyr Gin Val Tyr Pro Arg 

5 10 15 

Ser Phe Gin Asp Ser Asn Gly Asp Gly He Gly Asp Leu Pro Gly He 

20 25 30 

Thr Ala Arg Leu Glu Tyr Leu Ala Asp Leu Gly Val Asp Ala Val Trp 

35 40 45 

Leu Ser Pro Phe Phe Lys Ser Pro Met Lys Asp Met Gly Tyr Asp Val 

50 55 60 

Ser Asp Tyr Cys Asp Val Asp Pro Val Phe Gly Thr Leu Ala Asp Phe 
65 70 75 80 

Asp Ala Leu Leu Ala Arg Ala His Glu Leu Gly Leu Lys Val He He 

85 90 95 

Asp Gin Val Leu Ser His Ser Ser Asp Leu His Pro Ala Phe Val Thr 

100 105 no 

Ser Arg Ser Asp Arg Val Asn Pro Lys Ala Asp Trp Tyr Val Trp Ala 

115 120 125 

Asp Pro Lys Pro Asp Gly Ser Pro Pro Asn Asn Trp Leu Ser Val Phe 

135 140 
Gly Gly Ser Ala Trp Ala Trp Asp Ala Arg Arg Lys Gin Tyr Tyr Leu 

His Asn Phe Leu Thr Ser Gin Pro Asp Leu Asn Tyr His Asn Pro Lys 

165 170 175 

Val Gin Asp Trp Ala Leu Asp Asn Met Arg Phe Trp Leu Asp Arg Glv 
i80 185 190 
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Val Asp Gly Phe Arg Phe Asp Thr Val Asn Tyr Phe Phe His Asp Pro 

195 200 205 

Leu Leu Arg Ser Asn Pro Ala Asp His Arg Asn Lys Pro Glu Ala Asp 

210 215 220 

Gly Asn Pro Tyr Gly Met Gin Tyr His Leu His Asp Lys Asn Gin Pro 

230 235 240 

Glu Asn Leu lie Trp Met Glu Arg He Arg Val Leu Leu Asp Gin Tyr 

245 250 255 

Gly Ala Ala Ser Val Gly Glu Met Gly Glu Ser His His Ala He Arg 

260 265 270 

Met Met Gly Asp Tyr Thr Ala Pro Gly Arg Leu His Gin Cys Tyr Ser 

275 280 285 

Phe Glu Phe Met Gly Tyr Glu Tyr Thr Ala Asn Leu Phe Arg Asp Arg 

295 300 
lie Glu Ser Phe Phe Lys Gly Ala Pro Lys Gly Trp Pro Met Trp Ala 

315 • 

Phe Ser Asn His Asp Val Val Arg His Val Ser Arg Trp Ala Lys His 

, . 25 330 335 

Gly Leu Thr Pro Glu Ala Val Ala Lys Gin Thr Gly Ala Leu Leu Leu 

340 345 350 

Ser Leu Glu Gly Ser He Cys Leu Trp Glu Gly Glu Glu Leu Gly Gin 

360 365 
Thr Asp Thr Glu Leu Ala Leu Asp Glu Leu Thr Asp Pro Gin Gly He 

u 375 380 

Val Phe Trp Pro Glu Pro He Gly Arg Asp Asn Thr Arg Thr Pro Met 

390 395 
Val Trp Asp Ala Ser Pro His Gly Gly Phe Ser Thr Val Thr Pro Trp 

405 4^0 41S 

Leu Pro Val Lys Pro Glu Gin Ala Ala Arg His Val Ala Gly Gin Thr 

420 42 5 430 

Gly Asp Ala Ala Ser Val Leu Glu Ser Tyr Arg Ala Met Leu Ala Phe 

Arg Arg Ala Glu Pro Ala Leu Arg Thr Gly Arg Thr Arg Phe Leu Asp 

ou 455 460 

Leu Ala Glu Pro Val Leu Gly Phe Val Arg Gly Glu Gly Glu Gly Ala 

475 

He Leu Cys Leu Phe Asn Leu Ser Pro Val Ala Arg Gly Val Ala Jal 

485 ^90 495 

Glu Gly Val Gly Pro Pro He Gly Pro Gly Gin Gin Ala He Leu Ser 

Phe 
525 



Gly Gly Arg Leu Gly Leu Gly Pro Asn Gly Ala Ala Phe Leu Arg Val 

515 520 - - 

Thr Gly Thr Val Arg Val Leu Asp 
530 535 

<210> 11 
<2H> 1719 
<212> DNA 
<213> Unknown 

<220> . 

<223> Obtained from an environmental sample 
<400> 11 

gtgcccgaac tgaaatggtg gcagacagct atattttatc aaatctaccc gcgctctttc 60 
gccgacggca acggcgatgg catcggcgat ttcaaaggca tcatcggcaa actcqatta? 120 
ctacaaaatc ttggcataga tgcgctctgg ctctcgcctc acttcc^c?c ccccaactgg ISO 
gattgcggct acgatatcag cgattaccgc aacgttgcgc cggaatacgg cacgc^gqfc 240 
gatttcaaaa ccttcctgag cgaatcgcac aaacgcgg?a tSgcgtc!? tct?ga???c Ho 
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<210> 12 
<211> 572 
<212> PRT 
<213> Unknown 



gtgctgaatc acacctccga tgaacatccg tggttcatcg aatcgaaatc cagccgcgat 360 

aatcccaaat ccgattggta tgtgtgggtc gatacgccgc ccaacaattg gcagtcctgc 420 

ttcgatggcg atgcctggac atacgtccct gaacgcggcc aatattatta tcactacttc 48 0 

atgaaacagc agcccgatct caactggcat aatccgcagg tcaaacaggc catgtgggag 540 

gcggtgcgct tctggctcga tctcggcgtg gacggcttcc gcctggacgc catcggcacg 600 

atctacgaag acccaaatct cacgccgcat aatgtcccga tgaatttggc tgagctgcgt 660 

cacttcacag atgtcgccaa aacgccggaa gagatcaagc tcaaagaaaa atactggcac 720 

gacatgttca agcatcaatg gggtcagccc ggcgttcatg acctgatgaa agaactgcgc 780 

gccatcctcg atgaatatga tggcgaccgc atgctggtcg gcgaagatga caacatcgat 840 

tacatgggca acggagacga cgaattgcag ctggtcttca acttcccgtt gatgcgcgcc 900 

gatcgtctca cccccgacca tattcggcgc aaccaaaaag agcgtttgac tcgtctgaat 960 

gctttacccg ttaaaggctg ggcttgcaac acgctcggca accatgatag ttcacgcgtc 1020 

tacaccaaat tcggtgaccg gatccacggc gcggaccatg cacgtctcaa cctggcgctt 1080 

^S^^ C tgca ^gcac gccgttctta tacaacggcg aagagatcgg catgaccgac 1140 

""^ ccgatcccac caaactgcgc gacaccatgg caacctggta ttacaacagc 1200 

cttgtcaacg aaatgaaggt cgagccagcg gaggccgccc ttcgcgccgg acagatgacg 1260 

cgcgacaaaa accgtacccc catgcaatgg gacaataagc ccaatgccgg tttttgccca 1320 

gataaagccg aaccctggtt gccagtcaac cccaattacc gcgcaggcat taacgtccgc 1380 

gagcaaacat cgaacccgaa ctcgctgctc aattactata aacgtctcat ccacttgcgg 144 0 

cgggaaacgc ctgccttgat cgctggagat tacgttccgc ttcaccagac atccaaagat 1500 

catctggcct tcctgcgcaa aacagattca caaacgatcc tggtcgtttt gaattactcc 1560 

^° a tltt a t tgg aattgga tttctcgcgc accgtcgaaa tgaaaggccg cccgctgatc 1620 

gcaattttct ccagcgcaga tgaccgcccg caggcggcac aaagcccaaa gaaagtatcg 1680 
gtcggcgctt acggagttct gctggcagaa gtaaaatag 99 



1719 



<220> 

<223> Obtained from an environmental sample 

<221> DOMAIN 
<222> (14) . . . (480) 

<223> Alpha amylase, catalytic domain 
<400> 12 

Met Pro Glu Leu Lys Trp Trp Gin Thr Ala He Phe Tyr Gin He Tyr 

1 5 10 J 15 

Pro Arg Ser Phe Ala Asp Gly Asn Gly Asp Gly He Gly Asp Phe Lys 

20 25 30 

Gly He He Gly Lys Leu Asp Tyr Leu Gin Asn Leu Gly He Asp Ala 

35 40 45 

Leu Trp Leu Ser Pro His Phe Pro Ser Pro Asn Trp Asp Cys Gly Tvr 

50 55 60 

Asp He Ser Asp Tyr Arg Asn Val Ala Pro Glu Tyr Gly Thr Leu Asp 

5 70 75 80 

Asp Phe Lys Thr Phe Leu Ser Glu Ser His Lys Arg Gly He Arg Val 

85 90 95 * 

He Leu Asp Leu Val Leu Asn His Thr Ser Asp Glu His Pro Trp Phe 

100 105 HO 

He Glu Ser Lys Ser Ser Arg Asp Asn Pro Lys Ser Asp Trp Tyr Val 

115 120 125 

Trp Val Asp Thr Pro Pro Asn Asn Trp Gin Ser Cys Phe Asp Gly Asp 

135 140 
Ala Trp Thr Tyr Val Pro Glu Arg Gly Gin Tyr Tyr Tyr His Tyr Phe 

150 155 160 

Met Lys Gin Gin Pro Asp Leu Asn Trp His Asn Pro Gin Val L ys Gin 
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165 170 175 

Ala Met Trp Glu Ala Val Arg Phe Trp Leu Asp Leu Gly Val Asp Gly 

180 185 190 

Phe Arg Leu Asp Ala He Gly Thr He Tyr Glu Asp Pro Asn Leu Thr 

195 200 205 

Pro His Asn Val Pro Met Asn Leu Ala Glu Leu Arg His Phe Thr Asp 

zxu 215 220 

Val Ala Lys Thr Pro Glu Glu lie Lys Leu Lys Glu Lys Tyr Trp His 

Asp Met Phe Lys His Gin Trp Gly Gin Pro Gly Val His Asp Leu Me£ 

245 250 255 

Lys Glu Leu Arg Ala lie Leu Asp Glu Tyr Asp Gly Asp Arg Met Leu 

260 265 270 

Val Gly Glu Asp Asp Asn He Asp Tyr Met Gly Asn Gly Asp Asp Glu 
275 280 285 

LeU LSU Val PhS Asn Phe Pro Leu Met Ar< ? A1 a Asp Arg Leu Thr 

_ 295 300 

Pro Asp His He Arg Arg Asn Gin Lys Glu Arg Leu Thr Arg Leu Asn 

t 310 315 320 

Ala Leu Pro Val Lys Gly Trp Ala Cys Asn Thr Leu Gly Asn His Asp 

325 330 335 

Ser Ser Arg Val Tyr Thr Lys Phe Gly Asp Arg He His Gly Ala Asp 

345 350 
His Ala Arg Leu Asn Leu Ala Leu Leu Leu Thr Leu His Gly Thr Pro 

355 360 365 

Phe Leu Tyr Asn Gly Glu Glu He Gly Met Thr Asp His He He Thr 

370 375 380 

Asp Pro Thr Lys Leu Arg Asp Thr Met Ala Thr Trp Tyr Tyr Asn Ser 

Leu Val Asn Glu Met Lys Val Glu Pro Ala Glu Ala Ala Leu Arg Ala 

405 410 41 5 

Gly Gin Met Thr Arg Asp Lys Asn Arg Thr Pro Met Gin Trp Asp Asn 

420 425 430 

Lys Pro Asn Ala Gly Phe Cys Pro Asp Lys Ala Glu Pro Trp Leu Pro 

Val Asn Pro Asn Tyr Arg Ala Gly He Asn Val Arg Glu Gin Thr Ser 

Hoyj 455 460 

Asn Pro Asn Ser Leu Leu Asn Tyr Tyr Lys Arg Leu He His Leu Arg 

Arg Glu Thr Pro Ala Leu He Ala Gly Asp Tyr Val Pro Leu His Gin 

485 490 495 

Thr Ser Lys Asp His Leu Ala Phe Leu Arg Lys Thr Asp Ser Gin Thr 

500 505 510 

He Leu Val Val Leu Asn Tyr Ser Pro Asn Lys Leu Glu Leu Asp Phe 

515 520 525 K 

Ser Arg Thr Val Glu Met Lys Gly Arg Pro Leu He Ala He Phe Ser 

535 540 
Ser Ala Asp Asp Arg Pro Gin Ala Ala Gin Ser Pro Lys Lys Val Ser 

555 SGn 
Val Gly Ala Tyr Gly Val Leu Leu Ala Glu Val Lys 
565 570 

<210> 13 
<211> 1392 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
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<400> 13 



<210> 14 
<211> 463 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<221> DOMAIN 

<222> (21) . . . (446) 

<223> Family 4 glycosyl hydrolase 

<400> 14 

Met Glu Glu Glu Pro Arg Gly Lys Gly Leu Lys lie Ala Phe He Gly 

, 5 10 15 

Ala Gly Ser Ala Val Trp Ser Ser Arg He He Val Asp Leu He Leu 

20 25 30 

Ala Lys Ser Leu His Gly Ala Lys He Tyr Leu Met Asp He Asn Lys 
35 40 45 

Glu Arg Leu Asp Leu He Thr Gly Phe Ala Lys Arg Tyr Ala Ala Glu 

50 55 60 

Met His Ala Asp Leu Glu Phe He Pro Thr Met Asp Arg Val Glu Ala 

5 70 75 80 

He Arg Asp Ala Asp Phe Val Val Asn Ser Ala Met Tyr Gly Gly His 

85 90 95 

Met Tyr Tyr Glu Arg Met Arg Glu He Cys Glu Arg His Gly Tyr Tvr 

100 105 " 1 10 

Arg Gly He Asn Ser Val Glu Trp Asn Met Val Ser Asp Tyr His Thr 

115 120 125 

He Trp Gly Tyr Tyr Gin Phe Lys Leu Ala Leu Ser He Ala Lys Asp 

130 135 140 

Val Glu Asp Tyr Ala Pro Asp Ala Trp Leu He Asn Val Ala Asn Pro 

14 



atggaagaag agcctagggg taaaggccta aagatcgcat ttataggtgc tgggagcgcc 60 

gtatggagct ctaggataat tgtagacctt atcctcgcta aaagcc?gca tggcgccSg 120 

atttacctca tggatatcaa taaggagcgc ctcgacctca taacagg?tt cgccaagcga 180 

tacgctgccg agatgcacgc tgatctagag ttcatcccga caatggatcg tgtcgaag?g 240 

SSSSS ccgacttcgt cgtcaactca gccatgtatg gtgglcaca? g?ac?atgjg 300 

cggatgaggg aaatatgtga gagacacggc tattacaggg gaataaatag tgtcgagtgg 360 

;;?!!?f Ca gcgactacca caccatatgg ggctactacc agttcaaac? agcactaagc 420 

? Cgtcgagga "acgctccc gatgcatggc ttatcaacgt cgctaatcca 480 

gtgttcgaat tgacaacact gatccagagg cacgtcaaga taaagatgat agggctgtgc 540 

catggatacc acggcatata taatgtcatg aaagaactag gcctlccacg aglagaSa 600 

9 ?? tacta ^ att caaccacgtc atatggctta caaagttcaa gtaccaggga 660 

gaagacgctt acccgttact agacaagtgg atcgaggaga aagcagagaa gtactggga^ 720 

cattggagac aaacacaggt aaacccgttc gacatcgact tgtcgccggc agcga?agac 780 

atgtacaaga gatacggtct cctcccggta ggagacactg tgcg?gglgg gKgtggKg 840 

taccactggg atctcaagac gaagcagaaa tggtatggcc cgacaggagg accaglctcc 90S 

gagataggct ggatgatgta tatagccttc ctaagcatgc agctccaaag acta?acgaa 960 

?^!** 9g atc ***** c * cccgttagca gcacatgtac cgccggagtl gagcggtgJa 1020 

acaSalS Sf^ ^<=<=^cc aacaatagga ggggagaata cgt^taLLc lolo 

t*ll Ca9Ca t accc gg3ata ccggataatg tggctgtcga gatgccggcc 1140 

cagatagatg gtaaaggagt gcaccgctac atattcgagc cactccccaa gaagataaga 1200 

gacctggtcc tactgcctag gatgacccgt atggagatgg cattgacagc cttcctcqaa 1260 

ggaggccgtg aagtactaga ggactggcta cacatgga?? cacg?accaa gagcStaga llll 

ca? g ccag? g ^ aCaatCga t»«ctcctt aacatgcccg gtaacgagga gargaagaag lllo 
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145 150 155 

Val Phe Glu Leu Thr Thr Leu lie Gin Arg His Val Lys He Lys Met 

170 17 c 
lie Gly Leu Cys His Gly Tyr His Gly He Tyr Asn Va l Met Lys Glu 

80 185 190 

Leu Gly Leu Pro Arg Glu Glu Thr Glu Phe Glu Val Leu Gly Phe Asn 



His Val He Trp Leu Thr Lys Phe Lys Tyr Gin Gly Glu Asp Ala Tyr 

215 220 
Hi L6U L6U ASP LyS £P Ile Glu Glu Lys Ala Glu Lys Tyr Trp Glu 

235 240 



His Trp Arg Gin Thr Gin Val Asn Pro Phe Asp He Asp Leu Ser Pro 



Ala Ala lie Asp Met Tyr Lys Arg Tyr Gly Leu Leu Pro Val Gly Asp 



260 265 



Thr Val Arg Gly Gly Thr Trp Met Vyr His Trp Asp Leu Lys Thr Lys 

3 280 285 

Gin Lys Trp Tyr Gly Pro Thr Gly Gly Pro Asp Ser Glu lie Gly Trp 

Met Met Tyr lie Ala Phe Leu Ser Met Gin Leu G^n Arg Leu Tyr Glu 

Ala Leu Thr Asp Gin Lys His Pro Leu Ala 111 His Val Pro Pro 111 

325 330 „c 

Trp Ser Gly Glu Ser lie Val Pro lie He Asp Ser Leu Ala Asn Asn 

0 345 350 

Arg Arg Gly Glu Tyr Val He Asn Thr Leu Asn Leu Gly Ser He Pro 

360 365 
Gly lie Pro Asp Asn Val Ala Val Glu Met Pro Ala Gin He Asp Gly 

375 380 
Lys Gly Val His Arg Tyr lie Phe Glu Pro Leu Pro Lys Lys He Arg 

Asp Leu Val Leu Leu Pro Arg Met Thr Arg Met Glu Met Ala Leu Tr 

405 410 41 r 

Ala Phe Leu Glu Gly Gly Arg Glu Val Leu Glu Asp Trp Leu His Met 

425 430 
Asp Pro Arg Thr Lys Ser Thr Arg Gin Val Arg Glu Thr He Asp Asp 

Leu Leu Asn Met Pro Gly Asn Vlu Glu Met Lys Lys nil Phe Ser 



460 



<210> 15 
<211> 1632 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 15 



ill III iii ssp ~~ sss £ 

yy*Lucigcc cgnttttcac cagcccgatg gccgattatg gctatgacqt aacaoattaf oac\ 

W SSSS SSSS 2SS2S " 
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660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



tggctggagc ggggggtgga cgggttccgg ctcgatgcgc tgaaccactc gatgttcgat 
ccggcgctga ccgacaatct gcccgcgccc gaggatggca agatccgcac gcggcct?rc 

ItcalTaTa tit**** 0 ** Cagccagaac catcccgctg tgacgc?gtt ca?cgagcgg 
atcgccgatg tttgcggcca gcatggcgcg gtcttcaccg tggccgaagt gggcggcgl? 
ggcgcggttc cgctgatgaa ggcgtacacg gcgggcgagc atcggctgtc ctcggcc?ac 
agctttgatt tcctttatgc gccggccttg acgggcgagc tgg?ggccaa tgclctggcc 
cagtggacgg gcaagccggg ggccgatggg ctgagcgaag gctggcccag c?ggg C g?« 
gagaaccatg atgcgccgcg ccatatctcg cgctgggtgg gcgaggagca tcgcgccgcc 
aalllanlT J gagcC "^ ^tgctggcc tcgctgcgcg gtaacatgtt ca?gtltcag 
gggcaggaac tggcgctgga gcaggatgag atcccgtttc atctgctgaa agaccccaaa 
gcgatcgcca actggccgct gacgttgagc cgcgacgggg tgcgcacgcc gatgccatgg 
Taa^lf 39 CCttCCatgc cggcttcacg agtggcgagc cctggttgcc cttgtcgccg 13 20 
gggaatatcg ccaaggcggt ggatgtgcag gaggccgatc cgcagagcca gctgcactgg 1380 
ca?a?aS^ " ct ^ ccct gcgcgcgcgg cacaaggccc tgcgtc?ggg cgcgatgglg }JJ2 
catgtgcatg tgcagggcga tgtgctcagt ttcacccgcc atgcgcgcgq coaaaaaata isnn 
ctac?cIcS ! Caa r tttC cgccaaaa ~ 9tcgcccaca aggcgcacaa gggtglglcg llfo 
aaac^cgca? ^ aatggCgc aa ~ggggcg gtgttgacgc catatggcgc ?c?c?ggacg I62S 

1632 

<210> 16 
<211> 543 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<221> DOMAIN 
<222> (25) . . . (415) 

<223> Alpha amylase, catalytic domain 
<400> 16 

Met Thr Gin Ser Ser Ser Ala Val Ser Pro Ala Ser Ser Gly Pro Ala 

J 5 10 15 

Trp Trp His Gly Ala Ser He Tyr Gin He Tyr Pro Arg Ser Phe Ala 

25 30 

Asp Ser Asn Gly Asp Gly Val Gly Asp Leu Ala Gly He Thr Ser His 

Leu Glu His He Ala Ser Leu Gly Val Glu Ala He Trp He Ser Pro 

55 60 
Phe Phe Thr Ser Pro Met Ala Asp Tyr Gly Tyr Asp Val Ala Asp Tyr 

Cys Asp Val Asp Pro He Phe Gly Thr Leu Ala Asp Phe Asp Ala Leu 

90 95 
Val Glu Lys Ala His Gly Leu Gly Leu Lys Val Thr He Asp Met Val 

100 105 HO 

Phe Ala His Thr Ser Asp Arg His Pro Trp Phe Glu Gin Ser Arg Ser 

120 125 
Ala Arg Glu Asn Asp Arg Ala Asp Trp Tyr Val Trp Ala Asp Pro Lys 

Pro Asp Gly Thr Pro Pro Asn Asn Trp Gin Ser Val Phe Gly Gly Pro 

^ 1 

Ala Trp Thr Trp Asp Ala Arg Arg Gly Gin Tyr Tyr Met His Gin Phe 

165 170 175 

Leu Lys Glu Gin Pro Gin Leu Asn Ala His Asn Pro Ala Val Gin Asp 

180 185 190 

Ala Leu Leu Asp Ala Leu Arg Phe Trp Leu Glu Arg Gly Val Asp Gly 

200 205 
Phe Arg Leu Asp Ala Leu Asn His Ser Met Phe Asp Pro Ala Leu Thr 

16 
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a. I 10 t 215 220 

Asp Asn Leu Pro Ala Pro Glu Asp Gly Lys He Arg Thr Arg Pro Phe 

I ™_ 230 23 5 240 

Asp Phe Gin Leu Lys He Asn Ser Gin Asn His Pro Ala Val Thr Leu 

245 250 255 

Phe lie Glu Arg lie Ala Asp Val Cys Gly Gin His Gly Ala Val Phe 

260 26 5 270 

Thr Val Ala Glu Val Gly Gly Asp Gly Ala Val Pro Leu Met Lys Ala 

275 280 285 

Tyr Thr Ala Gly Glu His Arg Leu Ser Ser Ala Tyr Ser Phe Asp Phe 

yu 295 300 

Leu Tyr Ala Pro Ala Leu Thr Gly Glu Leu Val Ala Asn Ala Leu Ala 
, 310 315 3 ?0 

Gin Trp Thr Gly Lys Pro Gly Ala Asp Gly Leu Ser Glu Gly Trp Pro 

325 330 335 

Ser Trp Ala Phe Glu Asn His Asp Ala Pro Arg His He Ser Arg Trp 

0 345 350 

Val Gly Glu Glu His Arg Ala Ala Phe Ala Arg Met Ser Leu Val Leu 

355 3 60 365 

Leu Ala Ser Leu Arg Gly Asn Met Phe Met Tyr Gin Gly Gin Glu Leu 

0 375 390 

Ala Leu Glu Gin Asp Glu He Pro Phe His Leu Leu Lys Asp Pro Glu 

390 395 
Ala lie Ala Asn Trp Pro Leu Thr Leu Ser Arg Asp Gly Val Arg Thr" 

405 410 41 5 

Pro Met Pro Trp Asp Ser Gin Ala Phe His Ala Gly Phe Thr Ser Glv 

420 425 430 

Glu Pro Trp Leu Pro Leu Ser Pro Gly Asn He Ala Lys Ala Val Asp 

435 440 445 

Val Gin Glu Ala Asp Pro Gin Ser Gin Leu His Trp Val Arg Arg Val 

450 455 460 

Leu Ala Leu Arg Ala Arg His Lys Ala Leu Arg Leu Gly Ala Met Glu 
. 470 475 ~ 480 

Hxs Val His Val Gin Gly Asp Val Leu Ser Phe Thr Arg His Ala Arq 

485 490 495 

Gly Glu Arg Val Glu Cys Val Phe Asn Leu Ser Ala Lys Thr Val Ala 

500 505 510 

Hrs Lys Ala His Lys Gly Glu Thr Leu Leu Thr Val Asn Gly Ala Thr 

520 525 
Gly Ala Val Leu Thr Pro Tyr Gly Ala Leu Trp Thr Lys Leu Ala 
530 535 54Q 

<210> 17 
<211> 1422 
<212> DNA 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 
<400> 17 

atgaagtaca agaaccagtt aaaaccactc atggaagaag agcctcgggg taaaooceta *n 

llTlllT ttatag ^9C tgggagcgcc gtctggagc? c?agga?aT? cata"Scc« lS 

atcctcgcga aaagcctgca tggcgccaaa gtctatctca tggatattaa taagqaqcoa lln 

ctcgacctca taacaggttt cgccaagcga tacgctgccg agltgcatgc tga? c tagag 24? 

tttatcccta caatggatcg catagaggcg ataagggatg cagacttcgt cgtcaac?cf 3o2 

gccatgtacg gtggccacat gtactatgag cgtatgagag aaatatgtga gaggcacaqc l£ 

tattaccgag gaataaacag tgttgaatgg aacatggtca gcgattacca LccaSa An 

ggttactacc agttcaaact agccatgagc attgccaagg atgtagagga a^acgcgccl tlo 
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gacgcctggc ttatcaacgt cgccaaccct gtgttcgagc tgacaacact gatccagagg 540 
aaaoSw t3aagatgat a ^ctctgc catggctacc atggcatcta t..tgt£S 600 
ggctagacag ^atgagaca gagttcgagg tactcggctt caaccaigtg 660 
atctggctca caaagttcaa gtacagggga gaggacgctt accccttact agataagtgg 720 
atcgaggaga aagcagagaa gtactgggag cattggagac aaacacaggt gaacccgt?c 780 
gacattgact tgtctcctgc agcgatagac atgtacaaga gatacggtct Icttccqqta 
ggagatactg tgcgtggagg cacgtggatg taccactggg atctclagac gaagcagaaa 
c2i£j?£ Cga " ggagg ac cagactcc gagataggct ggatgatgta tatagccttc 
aciSISJ! agctCCaaag at tatacgaa gcactaatgg atcagaagca ccca?tagca 
gcacatatac cgccggagtg gagcggtgaa tccatagtcc caataatcga tagcctcgcc 
aacaatagga ggggagaata cgttatcaac acgttgaacc ttggtagtat accggggata 
ccggatagtg tagctgtcga gatgccagcc cagatagatg gtaaaggagt gcaccg<??ac 
atattcgagc ccctccccaa gaagataaga gacctagtcc tactgcctag gatgacccgt 

a g ? 9 cgttgacagc cttcctcgag ggaggccgcg aagtactcga ggactggcta i 320 
cacatggatc cacgtaccaa gagcactgga caggtacggg agacaatcga tgatctccto 1380 
aacatgcccg gtaacgagga gatgaagaag catttcagct ag ^gatctcctg 1380 

<210> 18 
<211> 473 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<221> DOMAIN 
<222> (21)... (446) 

<223> Family 4 glycosyl hydrolase 



<400> 18 

Met Lys Tyr Lys Asn Gin Leu Lys Pro Leu Met Glu Glu Glu Pro Arq 

1 5 10 15 

Gly Lys Gly Leu Lys He Ala Phe He Gly Ala Gly Ser Ala Val Trp 

20 25 30 

Ser Ser Arg lie He He Asp Leu He Leu Ala Lys Ser Leu His Gly 

35 40 45 

Ala Lys Val Tyr Leu Met Asp He Asn Lys Glu Arg Leu Asp Leu He 

50 55 60 

Thr Gly Phe Ala Lys Arg Tyr Ala Ala Glu Met His Ala Asp Leu Glu 

70 75 80 

Phe He Pro Thr Met Asp Arg He Glu Ala He Arg Asp Ala Asp Phe 



Arg Glu lie Cys Glu Arg His Gly Tyr Tyr Arg Gly He £sn Ser Val 

115 ion 



85 90 95 

Ar 

Se 

Glu Trp Asn Met Val Ser Asp Tyr His Thr He Trp Gly Tyr Tyr Gin 
130 135 140 



Val val Asn Ser Ala Met Tyr Gly Gly His Met Tyr Tyr Glu Arg Met 
100 105 

Cys Glu Arg His Gly 
115 120 125 



Phe Lys Leu Ala Met Ser He Ala Lys Asp Val Glu Glu Tyr Ala Pro 
10 150 155 

Phe Glu Leu 

Leu He Gin Arg His Val Lys He Lys Met He Gly Leu Cys His Gly 



Asp Ala Trp Leu He Asn Val Ala Asn Pro Val Phe Glu Leu Thr Thr 
165 170 



180 185 190 

Tyr His Gly lie Tyr Asn Val lie Lys Glu Leu Gly Leu Asp Arg Asp 

195 200 205 

Glu Thr Glu Phe Glu Val Leu Gly Phe Asn His Val He Trp Leu Thr 

210 215 220 

Lys Phe Lys Tyr Arg Gly Glu Asp Ala Tyr Pro Leu Leu Asp Lys Trp 



840 
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1320 
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225 23 ° 235 , 

He Glu Glu Lys Ala Glu Lys Tyr Trp Glu His Trp Arg Gin Thr Gin 



250 



Val Asn Pro Phe Asp n e Asp Leu Ser Pro Ala Ala He Asp Met Tyr 



265 



Lys Arg Tyr Gly Leu Leu Pro Val Gxy Asp Thr Val Arg Gly Gly Thr 



280 



Trp Met Tyr His Trp Asp Leu Eys Thr Lys Gin Lys Trp Tyr Gly Pro 



295 



Thr Gly Gly Pro Asp Ser gIu He Gly Trp Met Met Tyr He Ala Phe 

Arg Leu Tyr Glu Ala 
330 

He Pro Pro Glu Trp 
345 

Leu Ala Asn Asn Arg 
360 

Gly Ser He Pro Gly 
375 380 
Ala Val Glu Met Pro Ala Gin He Asp Gly Lys Gly Val His Arg Tyr 



Leu Ser Met Gin Leu Gin Arg Leu Tyr Glu aII Leu Met Asp Gin Lys" 



His Pro Leu Ala Ala His lie Pro Pro Glu Trp Ser Gly Glu III n e 



Val Pro He He Asp Ser Leu Ala Asn Asn Arg Arg Gly Glu Tyr Val 



He Asn Thr Leu Asn Leu Gly Se"r He Pro Gly He Pro Asp Ser Val 



He Phe Glu Pro Leu Pro Lys Lys He Arg Asp Leu Val Leu Leu Irl 



405 



410 



Arg Met Thr Arg Met Glu Met Ala Leu Thr Ala Phe Leu Glu Gxy Gly 
Arg Glu Val Leu Glu Asp Trp Leu His" Met Asp Pro Arg Thr Lys Ser 
Thr Gly Gin Val Arg Glu Thr He Asp Asp Leu Leu Asn Met Pro Gly 
Asn Glu Glu Met Lys Lys His Phe Ser 



465 

<210> 19 
<211> 1686 
<212> DNA 
<213> Unknown 



470 



<220> 

<223> Obtained from an environmental sample 
<400> 19 



Si Si iSE <~ s=s ™" 

aatggltatg frltalgtga JSSttS? tttlT^ ataaatcacc gcagcggga? 

SSI *™ ga ~" a aa " gga?^ 

ccgtaccgag SSXX a " tgaatgg "tcaagaag ctaaaaaatc gaaagarlac 

tgggtttS? aatctggggg a^Igcg?^ ttaiTT atggaagt * c tccglcgaat 
ctacxtttn* M-nro*.^^ auccgcgtgg gagcatgata acctcacaga acaatcgtat 

ag g g a gta 9 g ItllaltltZ ttgaactg ^ aaaacgagcg tgtgcgccgc 

g?t 9 t 9 aat g taaStcXa agatclgcot IIT^Z? 9 tagatggatt tcgtctgglc 

ggagatggcc ggaaattcta clccgacggt cctcgcg'ac Talllllltt Cgttgcacca 
aatcaagaag ttttttcaaa st^Z^l c ctcgcgtgc acgaatatat gcgggaaatg 
gttgacLc? gtattcagta 222^™ atg3Ccgtt 9 gagaaatgtc gtccacaacc 
tttLtcatt ?gaaag ag ga ttaccJaaa? laT 9 ^ agctta * tat gacgtttaat 
tttattaaat taaaagaga? tc^atclacr S* 3 * 39 * g ^ cccta 9 c agattttgat 
tggaatgcat tatttfgg^g caaccataat It^^" aaat ^ aataa ^ggtggggga 
gacgaactct atcataSaa ~ltl 2£2£ SgHSS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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ggaacgcctt atatctatca aggcgaagaa ataggtatga caaacccgaa gttttcctct 1140 

attgatgaat atagagatgt ggagtcatta aatgtgtatg aaataaaacg tgcacaagga 1200 

atggacgaaa atgaaatttt ggaaatttta aaacataaat caagagataa ttcccgtaca 1260 

ccggtgcaat ggaacgataa gccgaatgca ggttttacaa aaggaaagcc atggattcat 1320 

ff?? C f??!f actaccgtaa aattaatgta gaaaaagcgt tagaggataa agattcaatc 1380 

" U tt " atcaaaagct tattgcactg cgcaagcagt acgagattat cacctatgga 1440 

aactatgaat tgattcttgg agaagacgag cagattttcg cttatatccg aaatggagca 1500 

gatgaaaagc tgctcgtgat aaacaatttc tacggcagcg agaaaatttt tgaactgcca 1560 

gaaaatctaa cttttgaagg atatcatagt gaaatattgc tgtctaacta cgaagattca 1620 

ccaaaggaat tcaagcgagt cttacttcgg ccgtatgaat caatcgtgta tcatttaaaa ' " 

333 Ldg 



<210> 20 
<211> 561 
<212> PRT 
<213> Unknown 



1680 
1686 



<220> 

<223> Obtained from an environmental sample 

<221> DOMAIN 
<222> (13) . . . (419) 

<223> Alpha amylase, catalytic domain 
<400> 20 

Met Gin Glu Pro Trp Trp Lys Lys Ser Val Val Tyr Gin He Tyr Pro 

1 5 10 15 

Lys Ser Phe Tyr Asp Thr Thr Gly Asn Gly Val Gly Asp He Ala Glv 

20 25 "30 

He He Glu Lys Leu Asp Tyr Leu Lys Lys Leu Gly Val Asp Val Leu 

35 40 45 

Trp Leu Thr Pro He Tyr Lys Ser Pro Gin Arg Asp Asn Gly Tyr Asp 

50 55 60 

He Ser Asp Tyr Phe Val He Gin Glu Glu Tyr Gly Thr Met Glu Asp 
65 70 75 " 80 

Phe Asp Leu Leu Val Thr Glu Ala His Lys Arg Gly Leu Lys Val He 

85 90 95 

Met Asp He Val Val Asn His Thr Ser Thr Glu His Glu Trp Phe Gin 

100 105 HO 

Glu Ala Lys Lys Ser Lys Asp Asn Pro Tyr Arg Asp Phe Tyr He TrD 

115 120 ~ 125 

Lys Asp Gin Lys Glu Asp Gly Ser Ala Pro Thr Asn Trp Val Ser Lvs 

I 30 135 140 

Phe Gly Gly Ser Ala Trp Glu His Asp Asn Leu Thr Glu Gin Ser Tyr 
145 I 50 155 160 

Leu His Leu Phe Asp Val Thr Gin Ala Asp Leu Asn Trp Glu Asn Glu 

165 170 * 175 

Arg Val Arg Arg Ser Val Tyr Asp Met Met Thr Phe Trp Phe Glu Lvs 

180 185 190 

Gly Val Asp Gly Phe Arg Leu Asp Val He Asn Leu He Ser Lys Asp 

I 95 200 205 

Gin Arg Phe Leu Asp Asp Asp Gly Ser Val Ala Pro Gly Asp Gly Arg 

210 215 220 

Lys Phe Tyr Thr Asp Gly Pro Arg Val His Glu Tyr Met Arg Glu Met 
225 230 235 240 

Asn Gin Glu Val Phe Ser Lys Tyr Asp Ser Met Thr Val Gly Glu Met 

245 250 255 

Ser Ser Thr Thr Val Asp His Cys He Gin Tyr Ser His Pro Asp Arg 
260 265 270 
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Arg Glu Leu Ser Met Thr Phe Asn Phe His His Leu Lys Val Asp Tyr 

275 28 ° 285 

Pro Asn Gly Glu Lys Trp Ala Leu Ala Asp Phe Asp Phe He Lys Leu 

295 300 

305 TrP Gln Thr G1U Met Asn G1 V Gly Gly 

Trp Asn Ala Leu Phe Trp Cys Asn His Asp Gin Pro Arg Val Val Ser 

325 330 335 

Arg Tyr Gly Asp Asp Glu Leu Tyr His Asn Lys Ser Ala Lys Met Leu 

30 345 350 

Ala Thr Thr He His Leu Met Gin Gly Thr Pro Tyr He Tyr Gin Gly 

Jb5 360 365 

Glu Glu lie Gly Met Thr Asn Pro Lys Phe Ser Ser He Asp Glu Tyr 

' 3/u 37 5 380 

Arg Asp Val Glu Ser Leu Asn Val Tyr Glu He Lys Arg Ala Gin Gly 

390 395 4n tL 

Met Asp Glu Asn Glu He Leu Glu He Leu Lys His Lys Ser Arg Asp 

405 410 415 

Asn Ser Arg Thr Pro Val Gin Trp Asn Asp Lys Pro Asn Ala Gly Phe 

420 425 430 

Thr Lys Gly Lys Pro Trp He His Pro Ala Asp Asn Tyr Arg Lys He 

Asn Val Glu Lys Ala Leu Glu Asp Lys Asp Ser He Phe Tyr Phe Tyr 

u 455 460 

Gin Lys Leu He Ala Leu Arg Lys Gin Tyr Glu He He Thr Tyr Gly 

Asn Tyr Glu Leu lie Leu Gly Glu Asp Glu Gin He Phe Ala Tyr He 

485 4 90 495 

Arg Asn Gly Ala Asp Glu Lys Leu Leu Val He Asn Asn Phe Tyr Gly 

Ser Glu Lys He Phe Glu Leu Pro Glu Asn Leu Thr Phe Glu Gly Tyr 

520 525 
His Ser Glu He Leu Leu Ser Asn Tyr Glu Asp Ser Pro Lys Glu Phe 

JU 535 540 

Lys Arg Val Leu Leu Arg Pro Tyr Glu Ser He Val Tyr His Leu Lys 

Lys 550 555 560 



<210> 21 
<211> 1443 
<212> DNA 
<213> Eukaryote 

<400> 21 



60 
120 
180 
240 



tgaagatcgg tatcatcgga gcaggaagtg ccgttttctc actgaggctc 
tttgcaaaac tcccggtctt tccggaagta ccgttaccct catggacatc 
gacgaagaaa ggctagatgc ggttctgacc atagcaaaaa agtacgttga agaagtoaoa 
Ittllttllt agtttg f aaa Sacaacaagc gtagacgaag cLtcgcigJ Jgcgga???? 240 
gtgataaaca cagcgatggt gggtggccac acctatcttg aaaaggtcag aaggatcaac Vhn 
gaaaagtacg gatactacag aggaatagac gcgcaggag? tcaaSgg? ctclgactac 360 
tacacgtttt caaactacaa ccagctcaag tacttcgtgg atatcgcaag gaaalSaa 
agactctctc caaaggcgtg gtatctgcag gcggcaaa?c ctgtcftJga fggaacafc? 
cttgtgacaa gaacggttcc cataaaggca gtgggattct gclatggaca c?acggcgtg 
atggagatcg tagagaaact ggggctggaa gaaaacaggg ?agac?ggca gg^gclgg? 
gtgaaccacg gaatatggtt gaacaggttc agatacaacg gcgagga?gc lllalttllt 
ctagatagat ggatcagtga aaaatcgaaa gactggaaa? Lgagfac^c Scaacgac 

a^coto™ ITTT agaCat ^ tac -gattctacg gtgtgatgcc ca^cggtgal 
accgtgagaa actcttcgtg gaggtaccac agggatcttg agaccaagaa gaaa?gg?ac 



420 
480 
540 
600 
660 
720 
780 
840 
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~1 !IEH SHi «= ss=s 

gaagtggaaa aqattctcoa tS*„,„? aaggatcttt cggaaaaaca gttcgtcctt 
gatgcccta? tllllalrtt * CCtgaaaaa ^^gagcggtg aacagcacat tcctttcatc 

S35S Si ~? SSSS SSSK 

S2ss asbs Si' ~~ ssss sssss 

jtj^iJ .cSSS SSS 



<210> 22 
<211> 480 
<212> PRT 
<213> Eukaryote 

<220> 

<221> DOMAIN 
<222> (5)... (451) 

<223> Family 4 glycosyl hydrolase 



Met Pro Ala Val Lys He Gly He He Gly Ala Gly Ser Ala Val Phe 

Ser Leu Arg Leu Val Ser Asp Leu Cys Lys Thr Pro Gly Leu Ser Gly 

Ser Thr Val Thr Leu Met Asp lie Asp Glu Glu Arg Leu Asp Ala Val 

Leu Thr lie Ala Lys Lys Tyr Val Glu Glu Val Gly Ala Asp Leu Lys 

60 

Phe Glu Lys Thr Thr Ser Val Asp Glu Ala lie Ala Asp Ala Asp Phe 
Val lie Asn Thr Ala Met Val Gly Gly His IL Tyr Leu Glu Lys Val 



Glu Phe Asn Met Val Ser Asp Tyr Tyr Thr Phe Ser Asn Tyr Asn Gin 

120 125 

lie 
135 
Ala 

150 1" - - ITo 



Arg Arg He Ser Glu Lys Tyr Gly Tyr Tyr Arg Gly He Asp 1L Gin 
Met 
Phe 
Tyr 

Leu Val Thr Arg Thr Val Pro He Lys Ala HI Gly Phe Cys His Gly 

His Tyr Gly Val Met Glu He Val Glu Jys Leu Gly Leu Glu Glu Asn 

_ w> 185 



Leu Lys Tyr Phe Val Asp lie Ala Arg Lys He Glu Arg Leu Ser Pro 
Lys Ala Trp Tyr Leu Gin Ala Ala Asn Pro Val Phe Glu Gly Thr Thr 



Arg Val Asp Trp Gin Val Ala Gly Val Asn His Gly He ^rp Leu Asn 



200 



Arg Phe Arg Tyr Asn Gly Glu As^ Ala Tyr Pro Leu Leu Asp Arg Trp 



215 



— w 220 
lie Ser Glu Lys Ser Lys Asp Trp Lys Pro Glu Asn Pro Phe Asn Asp 

Gin Leu Ser Pro Ala Ala He Asp Met Tyr Arg Phe Tyr Gly Val Met° 

250 ore 
Pro He Gly Asp Thr Val Arg Asn Ser Ser Trp Arg Tyr His Arg Asp 

Leu Glu Thr Lys Lys Lys Trp Tyr Gly Glu Pro Tr P Gly Gly Ala Asp 

280 285 

22 
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Ser Glu lie Gly Trp Lys 
290 

Glu He Thr Lys 
305 

Arg Leu Ser Asp 



Gin Phe Val Leu 
340 

Gly Glu Gin His 
355 

Ser Arg Phe Val 
370 

Asp Asp Asp Val 
385 

He His Pro Glu 

Tyr Tyr Leu Arg 
420 

Phe Leu Thr Gly 
435 

Pro Arg Thr Lys 
450 

Leu Ser Leu Pro 
4 65 

<210> 23 
<211> 1686 
<212> DNA 
<213> Unknown 

<220> 



Lys Val 
310. 
Leu Gly 
325 

Glu Val 



He Pro 

He Asn 

Val Val 
390 
Lys He 
405 

Pro Arg 

Asp He 

Ser Asp 

Glu Asn 
470 



Trp 
295 
Ala 

Ser 

Glu 

Phe 

He 
375 
Glu 

Ala 

He 

Arg 

Glu 
455 
Glu 



Tyr Gin 

Lys Phe 

Val Leu 

Lys He 
345 
He Asp 
360 

Pro Asn 

Val Pro 

Pro Pro 

Met Arg 
425 
He He 
440 

Gin Val 
Glu Met 



Asp Thr Leu 
300 

He Lys Glu 

315 
Gly Lys Asp 
330 

Leu Asp Pro 

Ala Leu Leu 

Lys Gly He 
380 

Ala Val Val 

395 
Leu Pro Glu 
410 

Met Glu Met 

Lys Glu Val 

Glu Lys Val 
460 

Arg Lys Asn 
475 



Gly Lys Val Thr 
Asn Pro 
Leu Ser 



Glu Lys 
350 
Asn Asp 
365 

He Gin 

Asp Arg 

Arg Val 

Ala Leu 
430 
Leu Tyr 
445 

He Glu 
Tyr Leu 



Ser Ala 
320 
Glu Lys 
335 

Lys Ser 



Asn Arg 

Gly He 

Asp Gly 
400 
Val Lys 
415 

Glu Ala 

Arg Asp 

Glu He 

Lys Lys 
480 



<223> Obtained from an environmental sample 



<400> 23 

atgcaagaac 

gatacgactg 

aaaaagctcg 

aatggatatg 

tttgatcttt 

gttaatcata 

ccgtaccgag 

tgggtttcaa 

ctgcatttgt 

agcgtgtatg 

gttattaatt 

ggagatggcc 

aatcaagaag 

gttgaccact 

tttcatcatt 

tttattaaat 

tggaatgcat 

gacgaactct 

ggaacgcctt 

attgatgaat 

atggacgaaa 

ccggtgcaat 

ccggccgata 

ttttattttt 

aactatgaat 



cttggtggaa 

gaaacggcgt 

gggtagatgt 

atataagtga 

tagtaacaga 

catcaactga 

atttttatat 

aatttggggg 

ttgatgttac 

atatgatgac 

taatttcaaa 

ggaaattcta 

ttttttcaaa 

gtattcagta 

tgaaagtcga 

taaaagagat 

tattttggtg 

atcataataa 

atatttatca 

atagagatgt 

atgaaatttt 

ggaacgataa 

actaccgtaa 

atcaaaagct 

tgattcttgg 



aaaatcggtt 

gggtgatata 

tctgtggcta 

ttattttgtt 

agcgcataag 

acatgaatgg 

ttggaaagat 

atccgcgtgg 

gcaagcggat 

gttttggttt 

agatcagcgt 

caccgacggt 

atatgatagt 

ctctcatccg 

ttacccgaat 

tctatcaact 

caaccatgat 

atctgctaaa 

aggcgaagaa 

ggagtcatta 

ggaaatttta 

gccaaatgca 

aattaatgta 

tattgcactg 

agaagacgag 



gtttatcaaa 

gctggcatta 

acgccaattt 

attcaagaag 

cggggtctta 

tttcaagaag 

caaaaagaag 

gagcatgata 

ttgaactggg 

gaaaaaggag 

tttttagatg 

cctcgcgtgc 

atgaccgttg 

gaccgacgcg 

ggagaaaagt 

tggcaaacgg 

cagcctcgcg 

atgctcgcta 

ataggtatga 

aatgtgtatg 

aaacataaat 

ggttttacaa 

gaaaaagcgt 

cgcaagcagt 

cagattttcg 



tttatccaaa 

ttgaaaagct 

ataaatcacc 

aatacggaac 

aagtcatcat 

ctaaaaaatc 

atggaagtgc 

acctcacaga 

aaaacgagcg 

tagatggatt 

atgacggttc 

acgaatatat 

gagaaatgtc 

agcttagtat 

gggccctagc 

aaatgaataa 

ttgtttcacg 

caacgattca 

caaacccgaa 

aaataaaacg 

caagagataa 

aaggaaagcc 

tagaggataa 

acgagattat 

cttatatccg 



aagtttttat 

agattatttg 

gcagcgggat 

aatggaggat 

ggatattgtc 

gaaagataac 

tccgacgaat 

acaatcgtat 

tgtgcgccgc 

tcgtctggac 

cgttgcacca 

gcgggaaatg 

gtccacaacc 

gacgtttaat 

agattttgat 

aggtggggga 

ctatggagac 

tttgatgcag 

gttttcctct 

tgcacaagga 

ttcccgtaca 

atggattcat 

agattcaatc 

cacctatgga 

aaatggagca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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tgctcgtgat aaacaatttc tacggcagcg aaacagcctt tgaactgcca 1560 
actltnnl*? atatcatagt gaaatattgc tgtctaacta cgaagattca 1620 

ccaaaggaat tcaagcgagt cttacttcgg ccgtatgaat caatcgtgta tcatttaaaa lllo 



<210> 24 
<211> 561 
<212> PRT 
<213> Unknown 

<220> 

<223> Obtained from an environmental sample 

<221> DOMAIN 
<222> (13) . . . (419) 

<223> Alpha amylase, catalytic domain 



«J00> 24 

Met Gin Glu Pro Trp Trp Lys Lys Ser Val Val Tyr Gin lie Tyr Pro 

, 5 10 15 

Lys Ser Phe Tyr Asp Thr Thr Gly Asn Gly Val Gly Asp He Ala Glv 

20 25 30 

He He Glu Lys Leu Asp Tyr Leu Lys Lys Leu Gly Val Asp Val Leu 

35 40 45 

Trp Leu Thr Pro He Tyr Lys Ser Pro Gin Arg Asp Asn Gly Tyr Asp 

50 55 60 

lie Ser Asp Tyr Phe Val He Gin Glu Glu Tyr Gly Thr Met Glu Asp 

Phe Asp Leu Leu Val Thr Glu Ala His Lys Arg Gly Leu Lys Val He 

85 90 95 

Met Asp He Val Val Asn His Thr Ser Thr Glu His Glu Trp Phe Gin 

100 105 110 

Glu Ala Lys Lys Ser Lys Asp Asn Pro Tyr Arg Asp Phe Tyr He Trp 

115 120 125 

Lys Asp Gin Lys Glu Asp Gly Ser Ala Pro Thr Asn Trp Val Ser Lys 

130 135 140 

Phe Gly Gly Ser Ala Trp Glu His Asp Asn Leu Thr Glu Gin Ser Tyr 

150 155 160 

Leu His Leu Phe Asp Val Thr Gin Ala Asp Leu Asn Trp Glu Asn Glu 

165 170 " 175 

Arg Val Arg Arg Ser Val Tyr Asp Met Met Thr Phe Trp Phe Glu Lys 

180 185 190 

Gly Val Asp Gly Phe Arg Leu Asp Val He Asn Leu He Ser Lys Asp 

195 200 205 

Gin Arg Phe Leu Asp Asp Asp Gly Ser Val Ala Pro Gly Asp Gly Arg 

Z1U 215 220 

Lys Phe Tyr Thr Asp Gly Pro Arg Val His Glu Tyr Met Arg Glu Met 

30 235 " 240 

Asn Gin Glu Val Phe Ser Lys Tyr Asp Ser Met Thr Val Gly Glu Met 

245 250 
Ser Ser Thr Thr Val Asp His Cys He Gin Tyr Ser His Pro Asp Arg 

260 265 270 

Arg Glu Leu Ser Met Thr Phe Asn Phe His His Leu Lys Val Asp Tyr 

275 280 285 

Pro Asn Gly Glu Lys Trp Ala Leu Ala Asp Phe Asp Phe lie Lys Leu 

290 29 5 300 

Lys Glu He Leu Ser Thr Trp Gin Thr Glu Met Asn Lys Gly Gly Gly 

310 315 * j jr 

Trp Asn Ala Leu Phe Trp Cys Asn His Asp Gin Pro Arg Val Val Ser 
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325 330 335 

Arg Tyr Gly Asp Asp Glu Leu Tyr His Asn Lys Ser Ala Lys Met Leu 

340 345 350 

Ala Thr Thr He His Leu Met Gin Gly Thr Pro Tyr He Tyr Gin Glv 
355 360 365 * 

Glu Glu lie Gly Met Thr Asn Pro Lys Phe Ser Ser He Asp Glu Tyr 

370 375 380 y 

Arg Asp Val Glu Ser Leu Asn Val Tyr Glu He Lys Arg Ala Gin Gly 
I 5 390 395 400 

Met Asp Glu Asn Glu He Leu Glu He Leu Lys His Lys Ser Arg Asp 

405 410 415 

Asn Ser Arg Thr Pro Val Gin Trp Asn Asp Lys Pro Asn Ala Gly Phe 

420 425 430 

Thr Lys Gly Lys Pro Trp He His Pro Ala Asp Asn Tyr Arg Lys He 
435 440 445 

Asn val Glu Lys Ala Leu Glu Asp Lys Asp Ser He Phe Tyr Phe Tyr 
450 455 J 



4 60 

Gin Lys Leu He Ala Leu Arg Lys Gin Tyr Glu He He Thr Tyr Glv 
i 470 475 * 4 * 

Asn Tyr Glu Leu He Leu Gly Glu Asp Glu Gin He Phe Ala Tyr He 

485 490 495 

Arg Asn Gly Ala Asp Glu Lys Leu Leu Val He Asn Asn Phe Tyr Glv 

500 505 510 

Ser Glu Thr Ala Phe Glu Leu Pro Glu Asn Leu Thr Phe Glu Gly Tyr 

515 520 525 

His Ser Glu He Leu Leu Ser Asn Tyr Glu Asp Ser Pro Lys Glu Phe 

530 535 540 

Lys Arg Val Leu Leu Arg Pro Tyr Glu Ser He Val Tyr His Leu Lys 

Lys 550 555 ' 560 
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